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1. Introduction 


This document describes the window and one-sigma depth functions relevant to the 
Transiting Planet Search (TPS) algorithm in the Kepler pipeline (Jenkins 2002; Jenkins et 
al. 2017). The window function specifies the fraction of unique orbital ephemeris epochs 
over which three transits are observable as a function of orbital period. In this context, 
the epoch and orbital period, together, comprise the ephemeris of an orbiting companion, 
and ephemerides with the same period are considered equivalent if their epochs differ by 
an integer multiple of the period. The one-sigma depth function specifies the depth of a 
signal (in ppm) for a given light curve that results in a one-sigma detection of a transit 
signature as a function of orbital period when averaged over all unique orbital 
ephemerides. 


These planet detection metrics quantify the ability of TPS to detect a transiting planet 
signature on a star-by-star basis. They are uniquely applicable to a specific Kepler data 
release, since they are dependent on the details of the light curves searched and the 
functionality of the TPS algorithm used to perform the search. This document describes 
the window and one-sigma depth functions relevant to Kepler Data Release 25 (DR25), 
where the data were processed (Thompson et al. 2016) and searched (Twicken et al. 
2016) with the SOC 9.3 pipeline. In Section 4, we describe significant differences from 
those reported in Kepler Data Release 24 (Burke & Seader 2016) and document our 
verification method. 


These data products are archived at the NExScl Exoplanet Archive’ as two separate FITS 
files per target as described below. They are available for all targets searched for planets 
by the Kepler pipeline (Twicken et al. 2016) and can be downloaded as a set, or for 
individual targets through the interactive interface. 


' http://exoplanetarchive.ipac.caltech.edu/index.html 
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2. Window Function 


The window function for a transit survey specifies the fraction of unique orbital 
ephemeris epochs that permits = k transits to be observed as a function of period for a 
fixed-length data set (Gaudi 2000; Burke et al. 2006; Burke & McCullough 2014). In 
this context, ephemerides with the same period are considered identical if their epochs 
differ by an integer multiple of the period. The Kepler pipeline requires k = 3 transits for 
planet detection at each of the fourteen transit durations searched (i.e., 1.5, 2.0, 2.5, 3.0, 
3.5, 4.5, 5.0, 6.0, 7.5, 9.0, 10.5, 12.0, 12.5, and 15.0 hrs), so the reported window 
functions are based on this three-transit criteria. An additional check is performed by 
TPS when exactly three transits contribute to a putative detection. In the three-transit 
case only, each transit is inspected. If less than half of the in-transit cadences have full 
weight (e.g., full weight, w=1.0, and full deweighting, w=0.0) for any of the three transits 
that contribute to the detection, then the detection is rejected and does not become a 
Threshold Crossing Event (TCE). The three-transit weight check was not included in the 
DR24 calculation of the window function (Burke & Seader 2016). In Section 4, we 
discuss the impact of including this additional check. 


The FITS files are named kpIr<identifier>_<release>_window.fits, where <identifier> 
corresponds to the target's Kepler Input Catalog (KIC) identification number and 
<release> specifies the Kepler data release. Each file contains 14 binary table extensions 
for the 14 pulse durations searched by TPS. The transit duration is specified by the 
keyword TDUR in the FITS header for each extension. Each binary table extension has 
two columns corresponding to the orbital period (in days) and the window function 
(unitless) for the stated transit duration. In order to evaluate the window function at an 
arbitrary orbital period, it is up to the end user to interpolate between the tabulated orbital 
periods. However, in practice the resolution in orbital period provided is sufficient for 
‘nearest neighbor’ interpolation in most use cases. 


Due to the expected increase in transit duration with orbital period for transiting planet 
signatures, the entire period range is not searched at all transit durations. The shortest 
transit durations have an upper limit to the orbital periods searched, and the longest 
transit durations have a lower limit to the orbital periods searched. Thus, the range of 
orbital period over which valid window functions are computed by TPS varies with 
duration. Section 2.2 of Tenenbaum et al. (2014) describes details of the orbital period 
range searched as a function of transit duration. The orbital period range accommodates 
the expected transit durations assuming circular orbits with extended coverage to allow 
for moderately eccentric orbits. 


Several example window functions are provided in the left-hand panels of Figures | — 3. 
These examples illustrate targets that have varying amounts of observational data 
available. The window function ranges from 0 to 1. It has a value of one at short 
periods, where there is a high probability of having at least three transits. The window 
function then decays to zero at long orbital periods, since there is a decreasing probability 
that the flux time series is long enough to allow at least three transits to be observed. For 
nearly all Kepler targets, the window function varies insignificantly with transit duration, 
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so it is typically valid to adopt the results of any small set of durations that spans the full 
range of periods. This is well illustrated in the left-hand panels of Figures 1 — 3, where 
all 14 durations are plotted, but only one continuous curve is apparent. 
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3. One-Sigma Depth Function 


The detectability of a transit signal is predominately determined by its signal-to-noise 
ratio (SNR) (Gaudi 2000). One can estimate the transit SNR as A VNran/Ocdpp, Where A is 
the expected transit depth, Niran is the expected number of transits, and Ocapp 18 an 
averaged noise estimate from the root-mean-square (rms) CDPP (Christiansen et al. 
2012). However, this SNR estimate assumes all observations have the same noise 
properties and that missing or de-weighted data in the flux time series are evenly 
distributed in time. For the one-sigma depth function, TPS quantifies the transit signal 
depth that yields a SNR of unity as a function of the orbital period by taking into account 
the full details of the time varying properties of the noise and accounting for any missing 
or de-weighted data. When using the one-sigma depth function, Aone, the transit signal- 
to-noise ratio is simply SNR=A/Aone. 


As for the window function, the one-sigma depth function is archived in FITS format. 
Each file contains 14 binary table extensions giving results for the 14 transit durations 
searched by TPS. Each binary table extension has two columns corresponding to the 
orbital period (in days) and the one-sigma depth (in ppm). The file names have the form 
kpIr<identifier>_<release>_onesigdepth.fits, where <identifier> and <release> have the 
same definitions as for the window function files. In order to evaluate the one-sigma 
depth function at an arbitrary orbital period, it is up to the end user to interpolate between 
the tabulated orbital periods. However, in practice the resolution in orbital period 
provided is sufficient for ‘nearest neighbor’ interpolation in most use cases. 


Several examples of one-sigma depth functions are provided in the right-hand panels of 
Figures 1 - 3. These examples feature targets with varying amounts of missing data to 
illustrate the impact of incompleteness. In the large Miran (i.e., short-period) regime, Aone 
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Figure 1. Example planet detection metrics for a target with data available in every 
quarter (Q1-Q17). This target corresponds to KIC 10593626, the host of Kepler-22b 
(Borucki et al. 2012). Left: Window function as a function of orbital period. Right: 
One-sigma depth function as a function of orbital period. The top (bottom) line 
corresponds to the shortest (longest) transit duration searched for transit signatures. 
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increases as VPo.», Where Po» is the orbital period. For small values of Nian (i.e., long 
periods), Aone deviates from this simple dependence on P,, because the noise properties 
of the full flux time series are not well represented by such sparse sampling. 
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Figure 2. Example planet detection metrics for a target impacted by the loss of Module 3 
(i.e., every fourth quarter of data is missing). This target corresponds to KIC 11446443, 
the host of TrES-2b (O’Donovan et al. 2006). Left: Window function as a function of 
orbital period. Right: One-sigma depth function as a function of orbital period. The top 
(bottom) line corresponds to the (shortest) longest transit duration searched for transit 
signatures. 
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Figure 3. Example planet detection metrics for a target with data missing every other 
quarter. This target corresponds to KIC 8745553. In general, targets with significant 
amounts of missing data or patterns in the missing data segments have increased structure 
in the planet detection metrics as data coherence rapidly changes with orbital period. 
Left: Window function as a function of orbital period. Right: One-sigma depth function 
as a function of orbital period. The top (bottom) line corresponds to the (shortest) longest 
transit duration searched for transit signatures. 
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WARNING: Note that the FITS binary tables for the one-sigma depth function contain 
not-a-number (NAN) entries for orbital periods where there is insufficient data for the 
minimum three-transit-event detection requirement (i.e., the window function goes to 
zero). In order to prevent these NAN entries from propagating further, we recommend 
that end-users replace them with a suitably high number (e.g., 10° ppm) immediately after 
reading in the files such that relative flux decrements of 100% cannot make a significant 
detection. The target KIC 874553, shown in Figure 3, is impacted by such NAN entries 
as illustrated by the gap in the one-sigma depth function data over the 400-600 day 
orbital period range (right panel) where the window function has zero detectability (left 
panel). 
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4. Window Function Changes and Verification 


In this section, we briefly describe the method used to calculate the window functions 
and their verification using an alternative Monte-Carlo sampling method. For Kepler, the 
window function requires evaluating, for a given ephemeris (i.e., period and epoch) 
whether enough data exists to observe three independent transit events and whether the 
in-transit data passes the three-transit weight check described in Section 2. The results 
depend on the details of the TPS version employed. Twicken et al. (2016) and Jenkins et 
al. (2017) describe the SOC 9.3 pipeline employed for DR25. 


In principle, the window function could be computed by TPS during the actual transit 
search within the pipeline. However, in earlier versions, the requisite information was 
not retained between evaluations at different ephemerides. Therefore, the signal search 
function in TPS was augmented post facto to accumulate this information across all 
ephemeris evaluations. With the penalty of a few storage variables, the window function 
was then calculated outside the formal pipeline with this modified version of TPS. 


In parallel with this effort, we developed a Monte-Carlo transit injection and recovery test 
for TPS known as flux-level transit injection (FLTI). This FLTI uses a highly scaled- 
down version of TPS that only searches the ephemeris grid points enclosing the injected 
ephemeris location, rather than the entire period grid of a normal search. In brief, this 
FLTI code adds realistic limb-darkened transit signals to the flux time series that was 
previously quarter-stitched and detrended during the full TPS run (see Thompson 2016). 
The FLTI is based on TPS alone and is completely independent of the other pipeline 
modules (Jenkins et al. 2017). Hence, this is very different (and many orders of 
magnitude faster) than the pixel-level transit injection study of Christiansen et al. (2016), 
which injects transit signals into the calibrated (CAL) pixel-level data and runs the 
Kepler pipeline to perform the photometry (PA), systematic error correction (PDC), full 
transit search (TPS), and signal evaluation (DV). These shortcuts allow us to test the 
recovery of hundreds of thousands of realistic transit signatures injected at different 
ephemerides into the light curve of a single star. In this case, the FLTI code is used to 
independently calculate an empirical window function that can be used to verify the 
stand-alone, augmented TPS module’s window function calculation. This verification is 
only practical for a small set of stars because FLTI is orders of magnitude more costly 
due to the large number of injections required to infer the window function. However, 
once verified, the augmented TPS code can readily calculate accurate window functions 
for all Kepler targets. 


Figure 4 shows the results of the initial verification exercise, comparing the FLTI and 
augmented TPS window functions for four different stars. The initial verification 
identified differences as large as 50% between the FLTI window functions (black lines) 
and the initial augmented TPS window functions (red lines). This discrepancy was traced 
to the fact that the three-transit weight check was missing from the augmented TPS 
window function code (i.e., the code only accounted for the three-transit requirement). 
With the addition of an extra array to retain the results of the weight checks for three- 
transit events, the final augmented TPS window functions for DR25 (blue lines) are 
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consistent with the empirical window functions (black lines) computed with FLTI to 
within the sampling errors of these Monte-Carlo results. 


As a cautionary note, the three-transit weight check was also missing from the augmented 
TPS code used to produce the DR24 window functions (Burke & Seader 2016). At that 
time, the FLTI code was not available for verification, so this discrepancy was not 
identified. Consequently, the archival window functions for DR24 are overly optimistic 
representations of the actual SOC 9.2 pipeline performance. End users of the DR24 
window functions are encouraged to consider how this discrepancy might impact their 


scientific results. 
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Figure 4. Example window functions of representative targets with varying amounts of 
observational data. The target’s Kepler ID, data span, and duty cycle are specified along 
the top. The previous window function delivery (red line) for DR24 (SOC 9.2) is overly 
optimistic compared to the empirically derived window function (black line) computed 
with FLTI using a Monte-Carlo technique. The current window function delivery 
(DR25; SOC 9.3 — blue line) is in agreement with the empirical window function after 
including the three-transit weight check. 
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Appendix A: Column Definitions for Related Products 


In addition to the window and one-sigma depth functions described in this document, the 
computation of pipeline completeness contours requires a number of other parameters 
(see Burke & Catanzarite 2017). Since these parameters are required for every observed 
star in a selected sample, they are included in the DR 25 Stellar Table at the NExSclI 
Exoplanet Archive. Their column definitions are grouped under the heading “Occurrence 
Rate Columns” on the “Data Columns in the Kepler Stellar Page” at the Archive and are 
reproduced here for the convenience of our users. 


Database ray 
Colina Nave Table Label Description 
The fraction of data cadences within the span of observations that contain 
dutycycle Duty Cycle valid data and contribute toward detection of transit signals. The values for 
Duty Cycle ranges from 0.0 and 1.0. 
The duty cycle calculated after the transit search is complete and all transit 
Duty Cycle Post signals have been removed from the light curve. This metric is useful in 


dutycycle_post 


Planet Removal 


identifying targets where large amounts of data were removed as a result of 
multiple planet search iterations. 


dataspan 


Data Span [days] 


The time elapsed in days between the first and last cadences containing 
valid data. 


The data span calculated after the transit search is complete and all transit 


Dale Span Est signals have been removed from the light curve. This metric is useful in 
dataspan_post | Planet Removal : bas 
davai identifying targets where large amounts of data were removed as a result of 
y multiple planet search iterations. 
Multiple Event 


Statistic (MES) 


mesthresOlp5 | threshold for a 
searched transit 
duration of 1.5 hours 

mesthres02p0 MES threshold: 2.0 
hours 

mesthres02p5 MES threshold: 2.5 
hours 

mesthres03p0 MES threshold: 3.0 
hours 

mesthres03p5 MES threshold: 3.5 
hours 

mesthres04p5 MES threshold: 4.5 
hours 

mesthres05p0 MES threshold: 5.0 
hours 

mesthres06p0 MES threshold: 6.0 
hours 

mesthresO7p5 | MES threshold: 7.5 


The Multiple Event Statistic (MES) threshold reports the transit-signal 
significance level achieved by the transiting planet search (TPS) module. 


There is one entry for each of the 14 transit durations (1.5, 2.0, 2.5, 3.0, 3.5, 
4.5,5.0,6.0, 7.5, 9.0, 10.5, 12.0, 12.5, 15.0 hr) explored by TPS in the 
transit search. 


A value of 7.1 indicates that TPS reached the nominal search threshold 
significance, whereas a value above 7.1 indicates that TPS ended the search 
prematurely at the higher specified significance threshold. 


15 of 18 


KSCI-19101-002: Window & One-Sigma Depth Functions 


05/03/2017 


hours 
mesthres09p0 MES threshold: 9.0 
hours 
mesthres10p5 MES threshold: 10.5 
hours 
mesthres12p0 MES threshold: 12.0 
hours 
mesthres12p5 MES threshold: 12.5 
hours 
mesthres15p0 MES threshold: 15.0 
hours 
Robust RMS CDPP 
mmscdpp01p5_ | for a searched transit 
duration of 1.5 hours 
Robust RMS CDPP: 
mmscdpp02p0 2.0 hours 
Robust RMS CDPP: 
mmscdpp02p5 5 Hours 
Robust RMS CDPP: } , ; ‘ 
mmscdpp03p0 | 9 hours The robust root-mean-square (RMS) combined differential photometric 
: precision (CDPP) is an empirical estimate of the noise in the relative flux 
rrmscdpp03p5 Robust RMS CDPP: time series observations. TPS computes a non-stationary time series of 
3.5 hours CDPP, which sets the significance level of detected transit signals. 
mmscdpp04p5 Rope su MS GREE: F ‘ ; ae . 
4.5 hours The 14 entries are the CDPP time series summary statistics for 14 transit 
. |durations (i.e., 1.5,2.0, 2.5, 3.0,3.5,4.5,5.0, 6.0, 7.5, 9.0, 10.5, 12.0, 12.5 
Robust RMS CDPP: ’ ? ? ? ’ ? ? ? ? ’ ’ ? ? ’ 
rrmscdpp05p0 5.0 hours 15.0 hr) searched by TPS. These summary statistics are calculated in a 
robust fashion by excluding cadences de-weighted during the transit search 
rrmscdpp06p0 Robust RMS CDPP: | and employing median and mean absolute deviation algorithms, rather than 
6.0 hours the less robust arithmetic mean and standard deviation. Specifically, the 
Robust RMS CDPP: | Standard definition of the root mean square deviation of a data set X is 
rrmscdpp07p5 7.5 hours Xrms’= Xmean+Xstd’, where Xmean is the arithmetic mean of X and 
Xstd is the standard deviation of X. For the robust version, the arithmetic 
rrmscdpp09p0 a pre COE mean is replaced with the median and the standard deviation is replaced 
eta eal with the mean absolute deviation (mad) scaled to approximate the standard 
tacanplons Robust RMS CDPP: | deviation, so that (Xrobust_rms)* = (Xmedian)’ + (1.4826*Xmad)’. 
10.5 hours 
Robust RMS CDPP: 
pisedpp tape 12.0 hours 
Robust RMS CDPP: 
mised ppl ep2 12.5 hours 
Robust RMS CDPP: 
mmisedpp lope 15.0 hours 
BeCDE slope HOt The rmsCDPP slope for long transit durations is a linear fit to the 
cdppslplong Long Transit 


Durations 


log10(transit duration) versus log10(rmsCDPP) values for the six longest 
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transit durations (e.g. 7.5, 9.0, 10.5, 12.0, 12.5, 15.0 hr). 


Note: For white Gaussian noise, the rmsCDPP is expected to decrease with 
increasing transit duration because the noise decreases as the square-root of 
the number of in-transit cadences. However, targets with astrophysical 
noise or non-uniform power spectral densities can have rmsCDPP values 
that actually increase with increasing transit duration. Hence, this rmsCDPP 
slope serves as an indicator of non-white Gaussian noise behavior on the 
longest timescales searched. Monte-Carlo transit injection and recovery 
tests of the Kepler pipeline reveal that targets with large values of this 
rmsCDPP slope have suppressed transit recovery relative to targets with 
nominal rmsCDPP slopes. 


rmsCDPP Slope for 


See the definition of rmsCDPP slope for long transit duration. This metric 
is determined by fitting the slope of the second to sixth shortest transit 
durations (e.g. 2.0, 2.5, 3.0, 3.5, 4.5 hr) in order to characterize the noise on 


edepepshnt shout ean short timescales. This fit differs from the long transit duration slope in that 
Duration en : : : : 
it is performed in a semi-log space (e.g. log 10(transit duration) versus 
rmsCDPP). 
Timeout Indicator for 
timeoutOlp5 —_| a searched transit 
duration of 1.5 hours 
: Timeout Indicator: 
timeout02p0 5.0 houre 
: Timeout Indicator: 
timeout02p5 25 hours 
re {030 Timeout Indicator: 
pai aaad 3.0 hours The timeout indicators report whether the TPS planet search completed 
P ; : normally for the 14 transit durations. The searches in the TPS proceed from 
: Timeout Indicator: : : hee 
timeout03p5 the longest transit duration to the shortest. The searches at individual 
3.5 hours : ; : 
durations end when their allocated share of resources is consumed and the 
timeout04p5 Timeout Indicator: entire search process ends when an overall timeout is reached. 
4.5 hours 
: Timeout Indicator: Timeout indicator values: 
timeout05p0 
5.0 hours 
' Timeout Indicator: * 0 (zero) = the search ended normally after reaching its nominal 
timeout06p0 6.0 hours significance threshold of 7.1; 
; : ; ¢ 1 =the search ended prematurely, before the 7.1 significance 
: Timeout Indicator: : 
timeout07p5 threshold was achieved; 
7.5 hours : 
¢  2=no transit search was attempted. 
timeout09p0 Timeout Indicator: 
P 9.0 hours 
: Timeout Indicator: 
timeout 10p5 10.5 hours 
. Timeout Indicator: 
timeout 12p0 12.0 hours 
, Timeout Indicator: 
timeout 12p5 12.5 hours 
timeoutl5p0 —_| Timeout Indicator: 
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15.0 hours 


timeoutsumry 


Timeout Indicator 
Summary 


This is a binary flag where a value of | indicates that all 14 transit durations 
were searched down to the 7.1 significance threshold with no timeout 
issues, and a value of 0 (zero) indicates that at least one of the transit 
durations did not achieve a search down to the 7.1 significance threshold. 
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